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Amendments to the Claims: 
This hsting of claims replaces all prior versions and listings of claims in the application: 

Listing of Claims : 

1. (Currently Amended) A computer-implemented method of comparing the 
semantic content of two or more documents, comprising: 

accessing two or more a plurality of documents; 
performing a linguistic analysis on each document; and 

defining a semantic vector for each document based on the linguistic analysis, said 
semantic vector having multiple components, wherein each component of said semantic vector 
has at least: 

a term included in the document or a synonym of said term; 

a weighting factor relating to an importance, based on characteristics of the 
document, of said term; and 

a frequency value relating to a number of occurrences of said term ; and 
comparing a semantic vector of an identified document to the semantic vector for 
each document in the plurality of documents to determine at least one document 
semantically similar to the identified document . 

2. (Previously Presented) The method of claim 1 , wherein the linguistic analysis 
comprises sentence analysis. 

3 . (Previously Presented) The method of claim 2, wherein the sentence analysis 
comprises a syntactic analysis and a semantic analysis. 



4. (Canceled) 
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5. (Canceled) 

6. (Previously Presented) The method of claim 1, wherein each component of the 
semantic vector for at least one of the documents comprises multiple dimensions. 

7. (Previously Presented) The method of claim 1, wherein each component of the 
semantic vector for at least one of the documents further comprises a subordinate concept value. 

8. (Canceled) 

9. (Canceled) 

10. (Previously Presented) The method of claim 1, wherein some of the components 
of the semantic vector for at least one of the documents have main term - subordinate term pairs 
as their first value. 

1 1 . (Previously Presented) The method of claim 1, wherein the semantic vector 
comprises a multi-dimensional vector defined by the content of a semantic net. 

12. (Previously Presented) The method of claim 1 1 , wherein the content of the 
semantic net is augmented by relative weights, strengths, or fi-equencies of occurrence of the 
features within the semantic net. 

1 3 . (Previously Presented) The method of claim 37, wherein an output of said 
defined algorithm is a measure of at least one of semantic distance, semantic similarity, semantic 
dissimilarity, degree of patentable novelty and degree of anticipation. 
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14. (Currently Amended) A computer-implemented method of comparing two or 
more documents, comprising: 

linguistically analyzing two or more a plurality of documents to identify at least one 
term group in each document, each term group comprising a main term and at least one 
subordinate term semantically related to the main term; 

generating a semantic vector associated with each document, the semantic vector 
comprising a plurality of components, each component including: 
a term group in the document; 

a frequency value relating to a number of occurrences of the term group; and 
a weighting factor relating to an importance, based on characteristics of the 
document, of at least part of the term group; and 

comparing the semantic vector s a semantic vector of an identified document to the 
semantic vector for each document in the plurality of documents to determine at least one 
document semantically similar to the identified document using a defined metric, wherein 
said metric measures the semantic distance between two documents as a fimction of at least the 
frequency values included in the semantic vectors for the two documents. 

15. (Canceled) 

16. (Previously Presented) The method of claim 14, wherein the main term includes 
synonyms of the main term. 

17. (Previously Presented) The method of claim 14, wherein one or more of said two 
or more documents are located using an autonomous software or 'bot program. 

18. (Previously Presented) The method of claim 17, wherein the 'bot program 
automatically analyzes each document in a defined domain or network by executing a series of 
rules and assigning an overall score to the document. 
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19. (Previously Presented) The method of claim 18, wherein all documents with a 

score above a defined threshold are linguistically analyzed. 

20. (Previously Presented) The method of claim 14, wherein the semantic vector is a 
quantification of the semantic content of each document. 

21 . (Previously Presented) The method of claim 14, wherein each component has 
multiple dimensions. 

22. (Canceled) 

23. (Currently Amended) A system for comparing two or more documents, 
comprising: 

a document inputter, arranged to access two or more a plurality of documents; 

a semantic analyzer, arranged to perform a linguistic analysis on each document to 
identify at least one term group in the document, each term group comprising a main term and at 
least one subordinate term semantically related to the main term; 

a semantic quantifier, arranged to output a quantified representation of a semantic content 
of each document, the quantified representation based at least in part on: 
a term group in the document; and 

a weighting factor relating to an importance, based on characteristics of the 
document, of at least part of the term group; and 

a comparator, arranged to compare the quantified representations using a defined 
algorithm, wherein said defined algorithm measures the semantic distance between two 
documents as a function of at least the weighting factors associated with the quantified 
representations for the two documents to determine at least one document in the plurality of 
documents semantically similar to an identified document . 
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24. (Currently Amended) A system for comparing two or more documents, 

comprising: 

a document inputter, arranged to access two or more a plurality of documents; 

a semantic analyzer, arranged to perform a linguistic analysis on each document to 
identify at least one term group in the document, each term group comprising a main term and at 
least one subordinate term semantically related to the main term; 

a semantic vector generator, arranged to output a semantic vector associated with each 
document, each semantic vector comprising a plurality of components, each component 
including: 

a term group in the document; 

a frequency value relating to a number of occurrences of the term group; and 
a weighting factor relating to an importance, based on characteristics of the 
document, of at least part of the term group; and 

a comparator, arranged to compare the semantic vectors using a defined metric, wherein 
said metric measures the semantic distance between twe documents as a fiinction of at least the 
frequency values included in the semantic vectors for the two documents to determine at least 
one document in the plurality of documents semantically similar to an identified document . 

25. (Canceled) 
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26. (Currently Amended) A computer program product comprising a computer 

usable medium having computer readable program code means embodied therein, the computer 
readable program code means in said computer program product comprising means for causing a 
computer to: 

access two or more a plurality of documents; 

perform a linguistic analysis on each document to identify at least one term group in the 
document, each term group comprising a main term and at least one subordinate term 
semantically related to the main term; 

output a quantified representation of a semantic content of each document, the quantified 
representation based at least in part on: 

a term group in the document; 

a frequency value relating to a number of occurrences of the term group; and 
a weighting factor relating to an importance, based on characteristics of the 
document, of at least part of the term group; and 

compare the quantified representations using a defined algorithm, wherein said defined 
metric measures the semantic distance between twe documents as a function of at least the 
frequency values associated with the quantified representations for the two documents to 
determine at least one document in the plurality of documents semantically similar to an 
identified document . 
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27. (Currently Amended) A computer program product comprising a computer 

usable medium having computer readable program code means embodied therein, the computer 
readable program code means in said computer program product comprising means for causing a 
computer to: 

linguistically analyze two or more a plurality of documents to identify at least one term 
group in the document, each term group comprising a main term and at least one subordinate 
term semantically related to the main term; 

generate a semantic vector associated with each document, each semantic vector 
comprising a plurality of components, each component including: 
a term group in the document; and 

a weighting factor relating to an importance, based on characteristics of the 
document, of at least part of the term group; and 

compare the semantic vectors using a defined metric, wherein said metric measures the 
semantic distance between two semantic vectors as a function of at least the weighting factors 
included in the two semantic vectors to determine at least one document in the plurality of 
documents semantically similar to an identified document . 

28. (Previously Presented) The computer program product of claim 27, wherein the 
computer readable program code means in said computer program product further comprises 
means for causing a computer to: 

identify one or more of said two or more documents using an autonomous software or 

'bot program. 

29. (Previously Presented) The computer program product of claim 28, wherein said 
'bot program automatically analyzes each document in a defined domain or network by 
executing a series of rules and assigning an overall score to the document. 
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30. (Previously Presented) The computer program product of claim 27, wherein the 
semantic vector is a quantification of the semantic content of each document. 

3 1 . (Previously Presented) The computer program product of claim 27, wherein an 
output of said defined metric is a measure of at least one of semantic distance, semantic 
similarity, semantic dissimilarity, degree of patentable novelty and degree of anticipation. 

32. (Canceled) 

33. (Previously Presented) A system for comparing two or more documents, 
comprising: 

a document inputter, arranged to access two or more documents; 
a semantic analyzer, arranged to perform a linguistic analysis on each document; 
a semantic vector generator, arranged to output a semantic vector associated with each 
document; and 

a comparator, arranged to compare the semantic vectors using a defined metric, wherein 
said defined metric is one of: 

[Sqrt(fl^ + f2^ + + f4^ + + f(N-l)^fN^)/n] * 100, wherein f is a difference in 
frequency of a common term between two documents and n is the number of terms those 
documents have in common; or 

Sqrt(sum((w-Delta)^2 * w-Avg))/(Log(n)^3 * 1000), wherein w-Delta is the 
difference in weight between two common terms, w-Avg is the average weight between two 
common terms, and n is the number of common terms, between two documents. 
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34. (Currently Amended) A computer-implemented method of comparing two or 
more documents, comprising: 

linguistically analyzing two or more a plurality of documents; 

generating a semantic vector associated with each document; and 

comparing the semantic vectors using a defined metric, wherein said defined metric is 

one of: 

[Sqrt(fl^ + f2^ + + f4^ + + f(N-l)^fN^)/n] * 100, wherein f is a difference in 
frequency of a common term between twe documents and n is the number of terms those 
documents have in common; or 

Sqrt(sum((w-Delta)^2 * w-Avg))/(Log(n)^3 * 1000), wherein W-Deha is the 
difference in weight between two common terms, w-Avg is the average weight between two 
common terms, and n is the number of common terms, between twe documents to determine at 
least one document in the plurality of documents semantically similar to an identified 
document . 

35. (Previously Presented) A computer program product comprising a computer 
usable medium having computer readable program code means embodied therein, the computer 
readable program code means in said computer program product comprising means for causing a 
computer to: 

access two or more documents; 

perform a linguistic analysis on each document; 

output a quantified representation of a semantic content of each document; and 
compare the quantified representations using a defined algorithm, wherein said defined 
algorithm is one of: 

[Sqrt(fl^ + f2^ + + f4^ + + f(N-l)^fN^)/n] * 100, wherein f is a difference in 
frequency of a common term between two documents and n is the number of terms those 
documents have in common; or 

Sqrt(sum((w-Delta)^2 * w-Avg))/(Log(n)'^3 * 1000), wherein w-Delta is the 
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difference in weight between two common terms, w-Avg is the average weight between two 
common terms, and n is the number of common terms, between two documents. 

36. (Previously Presented) The method of claim 1 , wherein said term comprises at 
least one of a word or a phrase. 

37. (Previously Presented) The method of claim 1, further comprising comparing the 
semantic vectors based on a defined algorithm. 

38. (Previously Presented) The method of claim 14, wherein the at least one 
subordinate term includes synonyms of one of the subordinate terms. 

39. (Previously Presented) The method of claim 14, wherein one or more of the at 
least one subordinate term or the main term comprises a phrase. 

40. (Previously Presented) The method of claim 14, wherein the weighting factor 
comprises a plurality of different weighting factors and each of the different weighting factors 
relates to the importance of the main term or a subordinate term in the term group. 



